Tone recognition of Chinese continuous speech using tone critical segments
نویسندگان
چکیده
This paper presents our approach to automatically detect tone nuclei, and to use their features for recognizing lexical tones of Chinese continuous speech. We have suggested that Fundamental frequency (F0) contour of a syllable usually consists of three segments: onset course, tone nucleus and offset course. Among them, only tone nucleus contains key features for tone discrimination, hence the tone critical segment of a syllable. The other two segments result from physiological transition effect of human vocal cords, and are affected largely by adjacent tones in continuous speech. The tone nucleus can be detected out by a two-process scheme; the first process segments a syllable F0 contour by Segmental Clustering algorithm, and the second one finds tone nucleus according to knowledge rules on suprasegmental features. Tone recognition performance can be improved by using tone nucleus features and discarding others. Tone recognition experimental results proved the advantage of our method over the conventional ones.
منابع مشابه
Tone recognition of continuous speech of standard Chinese using neural network and tone nucleus model
A method is developed for recognizing lexical tone types of Standard Chinese syllables in continuous speech. Neural network (four-layered perceptron) is adopted as classifier. The method includes two steps; first recognizing tone types using prosodic features of voiced part, and then re-recognizing by viewing only on tone nucleus, which is a portion of the syllable showing rather stable fundame...
متن کاملSubsyllabic Tone Units for Reducing Physiological Effects in Automatic Tone Recognition for Connected Mandarin
This paper presents our attempt to model physiological transition effect on syllable F0 contour in order to improve lexical tone recognition performance for Mandarin Chinese. We suggested that a syllable F0 contour consists of three segments: onset course, tone nucleus and offset course. Among the three segments, only tone nucleus contains key features for tone recognition, and the other two re...
متن کاملTone Recognition of Chinese Continuous Speech
In this paper our approach to the lexical tone recognition of Chinese continuous speech is presented. The Mixed Gaussian Continuous Probability Model (MGCPM) [1] is used for the tone modeling, and the quadric curve is adopted to simulate the Fundamental frequency (F0) contour, whose three coefficients are calculated and taken as the features of the tone models. The tone variety in continuous Ch...
متن کاملStudy on tone classification of Chinese continuous speech in speech recognition system
In this paper, we first introduce the use of Gaussian mixture models (GMM) for Chinese tone classification in continuous speech. Then, we explain how to integrate it with the HMM-based speech recognition system. Finally, we provide the tone classification accuracy of this probabilistic method which is tested with Chinese continuous speech database of national “863” project.
متن کاملA Study of Thai Tone Classification
Tone classification is a necessary part for Thai speech recognition. This paper present the work based on our former work described in our published paper ref. [1] and the published paper ref. [2]. Several configurations of tone classification front-end for large vocabulary Thai speech corpus are implemented and compared. They include the tone-critical segments, feature setting, frequency scale...
متن کامل